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[57] ABSTRACT 

A unified architecture for implementing the modified cosine 
transforms of various cosine modulated filter banks in audio 
compression standards comprises a permutation module and 
a transform computing module. A modified cosine transform 
is computed by a pre-permutation followed by a discrete 
cosine transform and an inverse modified cosine transform 
is computed by a discrete cosine transform followed by a 
post-permutation. The discrete cosine transform computed 
in the unified architecture is selected from the group of 
type-II, type-Ill and type-IV cosine transforms. The com- 
putation of an N point discrete cosine transform is decom- 
posed into a permutation- add stage, a sub -transform stage 
for computing two N/2 point discrete cosine transforms 
selected from the same group, and a combination stage. The 
architecture results in good regularity and general applica- 
bility as well as reduces complexity. 

22 Claims, 7 Drawing Sheets 
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UNIFIED RECURSIVE DECOMPOSITION Recently, Kok has developed the fast algorithm for type-II 

ARCHITECTURE FOR COSINE DCT which can recursively decompose one type-II DCT 

MODULATED FILTER BANKS with length N into two type-II DCTs with length (N/2) (see 

"Fast algorithm for computing discrete cosine transform," 
FIELD OF THE INVENTION 5 IEEE Xrans 0Q Signal p or cess., Vol. 45, No. 3, March 1997, 

The present invention relates to cosine modulated filter PP- 757-760). The decomposition from one DCT into two 
banks (CMFBs) in current audio compression standards, and DCTs leads lo tne merit m modularity and regularity, 
more specifically to the architecture of implementing 
CMFBs and the fast computation of various modified cosine 
transforms (MCTs) used in CMFBs. 10 This invention has been made to overcome the above 



SUMMARY OF THE INVENTION 



BACKGROUND OF THE INVENTION 



mentioned drawbacks and inefficiency in the implementa- 
tion and computation of various CMFBs in the current audio 

In current audio coding standards such as MPEG-I Layers standards. The primary object of this invention is to provide 

1-3, MPEG-II Layers 1-4, MPEG-I V, and AC-3, cosine a unified architecture which leads all MCTs in audio com- 

modulated filter banks (CMFBs) have been widely adopted 15 pression standards into two modules comprising a pre-(or 

to transform an audio sequence from time domain to trans- post-) permutation and a DCT. 

form domain or subband domain for compression. However, According to this invention, various modified cosine 

all the CMFBs' formulae vary with not only the different transforms used in a time-domain aliasing cancellation 

standards but also the standard layers, block length, and (TDAC) filter bank, a variant of the TDAC (TD AC- variant) 

either encoder or decoder. For real-time applications, these 20 fiUer bank an(J a polypnase filter bank can be divided as two 

various forms need to be individually designed and tuned for modules comprising a permutation module and a discrete 

precision, complexity, and memory movements. cosine t rans f 0 rm module. The forward transform can be 

FIG. 1 illustrate the structure of CMFBs in an audio computed by a pre-permutation and a DCT while the inverse 

encoder and decoder. As shown in FIG. 1, the process of transform can be computed by a DCT and a post- 

CMFBs comprising two steps, i.e., the window-and- 25 permutation. The DCT can be either type-II, type-Ill or 

overlapping addition (WOA) and the modified cosine trans- type-IV. 

form (MCT). The WOA is to perform a windowing multi- Another object of this invention is to provide a fast 

plication and addition with overlapping audio blocks. computation method that is applicable to the three types of 

The complexity of this step is O(k) per audio sample, 3Q DCT, and a common computing vehicle for the three DCT 

where k depends on the overlapping factors of the forms. For types. Through the same computing vehicle, the software 

example, the factor k is 16 for the MPEG-I Layers 2 and is modules or hardware modules can be developed for all the 

2 for the AC-3. The second step, MCT, has a complexity audio compression standards. 

O(W) per audio sample, where W is the windowing length To achieve low ^fc^ this invention follows a direct 

and is quite different for various CMFBs. The range of Wis 35 compu tation approach. Recursive decomposition is the 

from 36 for MPEG-I Layer 3 to 4096 for the MPEG-IV. For yehide adopted in developing fast methods for sinusoidal 

WOA, direct implementation has been generally adopted lrari sforms such as the discrete Fourier transform and the 

and the design is straightforward. On the contrary, the discrete co ^ nt transform . mve ntion presents a recur- 

complexity of the MCT is high, and fast approaches have sive de compos ition mechanism for the decomposition of 

been developed based on similar concepts developed for the 4Q Qne type of DCT imo either type . n) type . TII) or type . IV 

fast Fourier transform. . . . . „ . , , 

. , i , , Accordingly, it is a further object of this invention to 

It has been widely known that developing fast approaches ide a fast algorilhm which recursively decomposes a 

like the fast Fourier transform and the fast cosine transform of DCT ^ k fa N ^ Qlher of DCTs with 

needs to consider the tradeoff between arithmetic length N/2 

complexity, regularity, modularity, and numerical precision. A , w , , , , , . c , . 

HeDce.it is always a critical issue for designing hardware or 45 . 11 has b u een demonstrated thai the architecture of this 

software for the fast MCTs. u,veDU , on •»? 8°° d features m regularity, complexity, and 

^ e . , . general applicability to all MCTs in audio coding standards. 

There have been many fast computing mechamsms devel- ^ aoa] ^ aIso shows , hat , he t mven . 

oped for the d^crete cosine transform (DCT). These mecha- ^ has a lexit malcM lhat of lhe well . known DCT 

nisms are developed for different transform length and 50 aoDroacn 

different DCT types. On the audio coding, the radix 2 DCT * J t „ c , 

is the main considering length. The developing of the 0ther objects and benefits of the invention will become 

radix-2 fast DCT mechanisms can be classified into two a PP arent ^ om * e de ! ailed description which follows here- 

approaches including indirect computation of the DCT inafte / when taken 10 conjunction with drawing figures 

through the fast Fourier transform or the fast Hartley 5S wmch accompany it. 

transform, and direct computation of the DCT through BRIEF DESCRIPTION OF THE DRAWINGS 
matrix factorization or recursive decomposition. 

However, these two approaches have some disadvantages. FIG - 1 illustrates the general process of cosine modulated 

The first approach needs additional complexity in mapping filter banks in an audl ° encoder and a decoder. 

DCTs into other transforms while the second approach in 6 o FIG * 2 illustrates the representation of MCTs into permu- 

general lacks the modularity and data regularity. tations and DCTs according to the present invention. 

As mentioned by Yun et al., "On the fixed-point-error FIG. 3 illustrates the formula of the three classes of the 

analysis of several fast DCT algorithms," IEEE Trans. CMFB and the correspondence with various audio-coding 

Circuits Syst. Video Technol., Vol. 3, February 1993, pp. standards. 

27-41, the modularity and the regularity are essential for 65 FIG. 4 illustrates the decomposition of one 8-point type-II 

designing hardware and generalizing to higher order trans- DCT into one 4-poinl type-II DCT and one 4-point type-IV 

form. DCT according to the present invention. 
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FIG. 5 illustrates the decomposition of one 8-point type- regularity, modularity and numerical precision of this inven- 

III DCT into one 4-point type- II I DCT and one 4-point tion are provided. 
type-IV DCT according to the present invention. 

FIG. 6 illustrates the decomposition of one 8-point type- Embodiments 

IV D ,^? ° nC ^ P ° im t 2 1,C " 111 DCT ^ 4 " POin, 5 1- The Unified Form for the CMFBs 
type-IV DCT accordmg to the present invention. 

rt ^ „ .„ , . , ... c , The MCTs used in current audio standards can be clas- 

FIG. 7 illustrates the arithmetic complexity of the . . , , .. . - nx 

approach according to the present invention and the existing f** mt0 according to the .three types of filter 

approaches for the radix-2 DCTs. banks: Ume - domain aliasin 8 cancellation (TDAC) filter 

10 bank, variant of the TDAC filter bank and the polyphase 

DETAILED DESCRIPTION OF THE fl^ ter bank- FIG. 3 illustrates the formula of the three classes 

PREFERRED EMBODIMENTS of the CMFBs and the correspondences to the current audio 

coding standards. 

As mentioned before, this invention leads the various c „ . .„ . „ , 

MCTs of CMFBs into two modules: a pre-(or post-) permu- 35 ™ C fo "f wmg dlUStrateS that aU the CM ™s ca n be 

tation module and a DCT module. FIG. 2 illustrates the represented as a pre- or post-permutation and a discrete 

representation of the MCT into a permutation and the DCT cosine lransform (DCT) as shown in FIG. 2. And the DCT 

according to this invention. In addition, the DCT derived tv P e can be one of me following- three types: 

from the MCTs can be one of the three types of DCTs Type-II DCT 

generally referred to as type- II, type-Ill, and type-IV. 20 

The permutation requires only straightforward operations ^ j n \ ^ 

but the computation of DCTs of various types greatly x * = L XiC0 \™ (2/+ m) ) fork = °- 1 N ' 1 - 

impacts the efficiency and performance of an audio encoder 
or decoder. To achieve low complexity, this invention fol- 
lows a direct computation approach that involves recursive 25 Type-Ill DCT 
decomposition of the DCTs. 

This invention uses a recursive decomposition method for w -i (2) 

computing various DCTs. The method of this invention x k = ^ ^cos^— (0(2* + 1)) for* = 0, 1 N-i. 

leads to two important features. The first feature is that the i=0 

approach has a data regularity which is a property of the fast 30 

Fourier transform but not a general property for the fast 

cosine transform. The regularity is important for the data lype-lv uci 

path design in VLSI chip design and the memory addressing 

in software programming. ^ / n x ®) 

r 6 6 35 X k = > Xicod — (2/ + 1X2* + 1) for * = 0, 1 N - 1. 

The second feature is that the fast method can be opti- V 4N 1 

mally implemented for all the MCTs in audio standards. 

Because the method recursively and regularly decomposes 

the long length transforms into short length ones through In the above equations (l)-(3)> tnere nave DeeD constant 
three types of the DCTs, the unrolling of the recursive terms in front of each formula. For example, the type-IV 
decomposition from length N into length 2 will be the DCT is 
interleaving of the three types of the DCTs. 

In other word, the fast method is applicable to all the three 
types of the DCT, and the computing vehicle for the three 
DCT types is the same. Hence, this invention demonstrates 45 
that all the various CMFBs in the audio coding standards 
lead to different pre-permutations or post-permutations but Because the constant term will not affect the derivation 
will have the same computing vehicle for the DCTs. result, the constant term 
Consequently, through the same computing vehicle, the 
software modules or hardware modules can be generally 50 , — 
developed for all these audio compression standards. / 1. 

Furthermore, this invention develops the decomposition " N 

through the interleaving of three types of DCT instead of the 

same type of DCTs in order to have the regularity and the ^ neg iected for simplification. 

modularity. Because the decomposition is the interleaving of 55 ™ TT . r , „ r %Mr ^r . r r ~ A ~^. u n , 
4 , / ~™ , j • r 1 1 , a. The Unified Form for the MCT in I DAC Filter Bank 

the three types of the DCTs, the fast method is applicable to 

all three types of the DCTs instead of just the type II as Kok First > method t0 transform the modified cosine trans- 
proposed. The general applicability is the key factor to form (MCT) in the TDAC filter bank into a permutation and 
develop the fast algorithm for the cosine modulated filter a type-IV DCT is illustrated. 

banks in the current audio standards. 60 As snown m FIG. 3, the forward and inverse MCTs of the 

In the following part of this description, how all the TDAC filter bank are respectively defined as 
CMFBs lead into permutations and discrete cosine trans- 
forms is illustrated, and the decomposition of the three types v' / * ( ■ N l ) (4) 
of the DCTs into other types of DCTs is demonstrated. In Xk = L XiC0 \w( 2i + 1 + 7J 0 * + 1} J 
addition, the preferred embodiments for the decomposition 65 

of type-II DCT and , type-Ill DCT and type-IV DCT are for* =0, 1 N/2-U 

shown. Moreover, the analysis of arithmetic complexity, 



x » = /Pz W^ (2/ + 1X2 * + 1) ) foTk = 0 ' 1 N ~ u 
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-continued -continued 

Xi = £ Aico^ — ^2/+ 1 + y) (2/: + ^) 88 Z ^*cos(— (2/+ 1X2* + 1)), for t = 0, 1 N-l 

for/ = 0,1 /V-l. 5 

Through (8) and (9), some fundamental to compute 

The constant term before each summation is again TDAC transform is set. 

neglected for simplification of the representation. Also note ^ 

that, unlike the general transform, these x ( . are in general not 10 N l 

equal to the x,- given the same X*. X ' t = £ (2;+ 1X2* + i)) + 

The process for transforming the MCT in TDAC filter 2N 

bank into the permutation and the type-IV DCT comprises NIA . X 

the following three steps: £ -^cos(— (2m + i)(2* + l)) 

Al) Extending the transform pair in (3) and (4) to a form 15 m=0 

which has length N along both indices i and k. _ y, 1 ^ , » ^ + ^ + * 

A2) Representing the extended transform with length N as ' *2N f 

a transform with length N, which is quite similar to the 

type-IV DCT. 20 
A3) Reducing the DCT-like transform with length N to 

the type-IV DCT with length (N/2). This is from the following form (10) 

In accomplishing the unified form, defining the following 
transform pair: „ , N (10) 

U; = - X 3/v , for / = 0, 1, ... , 1, 

25 < + ^r 4 

»=* < n f N\ ^ («) N N 



x * = z^ cos (2^( 2 '' +i+ ?) (2A+ 4 



i=0 

for* =0, 1, .... N- 1, 



«; = X. n , for /=-,- + 1 N - 1. 

,_ T 4 4 



N j ^ 30 The sequence x',- computed from (7) can be obtained from 

jej = - V x' k coI— yii + 1 + — 1(2* -t- 1)1 the sequence u,. computed from (9) through the following 

2 t-o ^ 2/vV 2} } permutation (11) for N is a multiple of 4. 

for i = 0, 1 N-l. 

1 3N (11) 

* = -a. a,, for/ = 0, 1..., 1 

35 J 2 ,+ t 4 

The relation between the extended transform and the 

TDAC transform is as follows: x- = -u 1N , for / = — , — + l ... , n - l 

' 2 4 4 

Rl) The sequence of the X'* in (6) is anti-symmetric in the 

sense that X'^-X 1 ^,^ if N is a multiple of4. 
For N is a multiple of four, relation Rl) is achieved by 40 Because the X* computed from (8) is anti-symmetric in 
representing X^.^ as the sense that X^^-X^ for k=0, 1, 2, . . . , N-l, this yields 

expression (12) 

^.^/f^^.l.^-l-*).!)) „ (12) 

45 ** = Z MiCO W 2i " + im + l) ) f ° r * * °* 1 N ' 1 



for * = 0 N-l 



50 



g Ui cos(^(2/+lX2A+l)) for* = 0,1 --1 

for*=^...yv-i 



R2) For N is a multiple of four, assuming that the 

sequence XV with length N is achieved by extending Re p resem ing type-IV DCT with length (N/2) according to 

the sequence X k with length (N/2), then sequence x ( - in ^) gives 

(5) is equivalent to that of x',- in (7), given formulas (5) 55 

and (7). H i (13) 

Relation R2) is achieved by substituting XV— X V_!_* for y _ V , cnJ n m + + nW«r * o i N . 

N is a multiple of four and k is from N/2 to N-l. * ~ L ^ + im + !) ) for * = °' 1 2 " 1 

Through relations Rl) and R2), the MCT transforms in (4) 

and (5) are computed through (6) and (7) respectively, 60 

In order to compute TDAC, the DCT-like transform first In order to compute the DCT-like transform in (8) and (9) 

is defined as follows. through type-IV DCT in (13), the following expressions are 

set to be the basis. 

N-i n (8) That is, the sequence X* in (8) for k-0, 1,2,... (N/2)-l 

x t = Yj "iwiifiW + J X2* + 1)), for * = 0 N-l, 65 ^ equivalent to the sequence Y k in (13) if 

»»o 

srur»y-i-i for i-0, 1, . . . , (N/2)-l (14) 
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Yi = £ * cos (^ (2 * + W + »)) for i = 0, 1 y - 



(15) 



X 4 ' = V*J«w(-^-(2i + lK2* + l)) 
for* = 0, 1 N/2- 1, 



3 = £*M^ (2 '' +1)(2 * + 1) I 
for / = 0, 1 



The second transform pair is 

X 2 t = £j<?cos( — (2/ + 1 + *X2* + 1)) 

<=0 

for* = 0, 1 N/2- 1 

5? = V X 4 2 co/-^-(2l + 1 + Af)(2* + 1)1 
for i=0,l W-l 



(16) 



(17) 



(18) 



(19) 



8 



For i=0, 1, 2, ... , (N/2)-l, the sequence u,. in (9) is 
equivalent to the sequence Y ; of type-IV DCT in (15) with 
s*-2X*, where expression (15) is as follows. 



That is, the sequence X* 2 in (18) is equivalent to X* 1 if 



Summarizing the foregoing description, the MCT in (4) is 30 
computed through the type-IV DCT in (13) with the input 
permutation through (10) and (14), and the inverse MCT in 
(5) is computed through the type-IV DCT in (15) with the 
output permutation in (11). 

b. The Unified Form for the Variant of TDAC Filter Banks 
Dolby AC-3 coder has adopted two variants of TDAC 
filter banks in order to provide perfect reconstruction prop- 
erty between different block size. As shown in FIG. 3, the 
first transform pair is defined as 20 



for/ = 0. 1 .... 1, 

2 

N N 
for /=-,- + 1 . 
2 2 



(20) 



15 



This is because 

x l s Z %-Nn C0 { — W + 1X2* + 1) - 

:-Kiit ^2N / 



Z *£+«/2»{;p-<2m + 1X2* + 1)] 

m=0 2A/ 

'■XL 



In addition, the sequence Xj 2 in (19) is equivalent to x/ 



25 



30 



for i = 0, 1.. 



N N 
fori= — , — + t . 
2 2 



(21) 



,N-\ 



35 



Summarizing the foregoing description, from expression 
(14), the MCT of the first TDAC-variant in (16) is computed 
directly through the type-IV DCT in (13) with the input 
40 permutation. 



(22) 



The process for transforming the TDAC-variant in (16) 
and (17) into the permutation and the type-IV DCT com- 
prises the following steps: 

Bl) Setting the relation between the transform pair in 

(16H17) and that in (8)-(9). 
B2) Setting the fundamental to derive the TDAC-variant 

in (16) and (17) through DCT-like transforms in (8) and 

(9) according to the equation of X^-X^.^j for k=N/2, 

N/2+1, . . . , N-l. 
B3) The DCT-like transform is computed through the 

type-IV DCT by setting the basis mentioned in earlier 

description. 

In step Bl), the relation is U-2X,- 1 , and in step B2), the 
anti-symmetric property of X* and the relation in step Bl) 
are the fundamental. 

Hence, the first form of the TDAC-variant transform is 
transformed into the permutation and the type-IV DCT. 

The following expression (20) and (21) illustrate the 
relation between the MCTof the TDAC-variant in (16X17) 
and that in (18}-(19). 



45 



50 



From two equivalence relations, one is sequence u, in (9) 
and sequence Y t of type-IV DCT in (15), the other is 
sequence u, in (9) and two times the sequence computed 
from (9), the inverse MCT of the first TDAC-variant in (17) 
is computed directly through the type-IV DCT in (15). 

From (14) and (20), the MCTof the second TDAC-variant 
in (18) is computed directly through the type-IV DCT in (13) 
with the input permutation in (20) and (22). 

From two equivalence relations, one is sequence u ; in (9) 
and sequence Y ( - of type-IV DCT in (15) and expression (21), 
55 the inverse MCT of the second TDAC-variant in (19) can be 
computed directly through the type-IV DCT in (15) through 
the output permutation in (21). 
c. The Unified Form for the Polyphase Filter Banks 
As shown in FIG. 3, the transform pair for the cosine 
modified transform in polyphase filter bank is defined as 



60 



65 



for* =0, 1 W/2 - I 



(23) 
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-continued 

"&« N) , (24) po=«o (32) 

*,•= > X k co4- \i+ - (2* + 1) ^ /V 

^ \N\ 4) 1 for/=l,...,y-l 

for / = 0, I .... N- 1 5 

Summarizing the above analysis, the MCT in (23) is 
To compute the MCT in (23) and the inverse MCT in (24), computed through the type-Ill DCT in (31) with the input 
the invention provides the following two transform formu- permutation through (27) and (32). And, the inverse MCT in 
las: 10 (24) is computed through the type-Il DCT in (29) with the 

output permutation in (28) and (30). 

x' k = J]tt t -cos(— (/)(2* + 1)) for* =0, l Njl- 1 From the preceding detailed description, the invention 

1=0 illustrates that all the various modified cosine modulated 

Nn _ x (26) 15 transforms used in TDAC, TDAC-variant, and polyphase 

Qi = Yj ^i'cos(— (i)(2Jt + 1)] far i = 0, l ... , ;v- l filter banks can be divided into two modules: permutation 

i=0 and the DCT. Especially, the forward transform can be 

represented as a pre-permutation and a DCT while the 

After having given (23) and (25), the sequence X\ 2Q to™** transform as a DCT and a post-permutation. The 

computed through (25) is equivalent to the sequence X k DCT can be tv P e " n » type-Ill, or type-IV. 

computed through (23) if 2. Decomposition for the Discrete Cosine transform 

The following provides a method to decompose a type of 

u i = * + * (27) DCT with length N into two of the three types of the DCT 

3N 25 with length N/2. 

for = °' 1 T~ 1; a. Decomposition for the Type-II DCT 

Uj = -Jt._3Af From (1), the kth coefficient of the type-II DCT for an 

3 ^ 3 ^ input sequence x,- with length N is 

f 0r /= _ + 1, ...,/V- 1 30 
4 4 

X k = £ *i-cos(— (2/ + IX*)) for * = 0 . . - 1 . 

Similarly, after having given (24) and (26) and X Jfc =X , Jt , 
the sequence x i computed through (24) can be permuted 

from the sequence u, computed through (26) with the 35 First, decomposing X k of the type-II DCT into even- 
following form indexed and odd-indexed form. The even -indexed output 

sequence is 

:? ( -**d lVW4 for (=0 . . . 3N/4-1 and +1 x+\ i™-w,-_3w/4 

/or (-3N/4, 

3N/4+1 . . . N-l (28) * i M (33) 

X 2k = Y Xicod — (2( + 1)(2*) 1 for k = 0 N/2-1. 

*Ht ^2N ' 
Acording to (1), the type-II DCT with length (N/2) is 1=0 

■r-i n N (29) Next, one type-II DCT and one type-IV DCT with length 

x > ' Yj x^^O** txo) for/ = 0, 1 --1 45 (N/2) are achieved by applying the symmetry and anti- 

* =0 symmetry properties of cosine function on the even- indexed 

output sequence and odd-indexed output sequence respec- 

Let X'j^x^ expression (29) gives the sequence u ( in (26) lively, 
from X,. through 50 Applying the symmetry property as 



u, = 0 forf=/V/2 (30) 
ui = X, for ( = 0.1 N/2-l 



(2 (N - l - 0 + ix*) = cos) -(2/ + 1)(A) 



(34) 



JV N rive* 
-Xf,„i for i = y + 1 , — + 2 ... t N - I ss &vcs 



A//2-I 



According to (2), the type-Ill DCT with length (N/2) is 



*2*= £ Ui+Jr«-i-,0c^-(2/+l)<*)), 



£_i (31) 60 which is a type-II DCT with input permutation. 

Xi = 2 77(0(2* + 1)) for * = 0, l - - 1 The odd-indexed output sequence is 

N-l ^ 

Xn+ 1 = Y^cosf — (2/+ 1)(2*+ 1)), for/ = 0 N/2- 

In addition, the sequence X k in (31) is equivalent to the 65 £0 2/v 

sequence X'* in (25) if the sequence x ( is computed from the 

sequence u ( through 
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Applying the anti-symmetry property as c. Decomposition for the Type-IV DCT 

Before proceeding with the decomposition for the Type- 

1 n ,^ 1 n . ^ (33) IV DCT, this invention first provides the following property, 

coj- ( 2, + ix» + 1) = -co^jj W - 1 - 0 + 1X2* + 1) . e aQ (N+1)xN iypeni D( P be simpl . ficd 

5 type-Ill DCT 

gives 

Nn ~ 1 { n N 7i N ~ l it 

Xit+i = £ Ui-* W -i-,-)co S (— (2i+lX2t + l)), Z*H™ (i ' X2A+1) ) = Z Jr H2N (0(2 * +1J ) 

i-0 j.-0 i=0 

10 

which s a type-IV DCT with input permutation From (3) the kth ooeflicieilt of ^ DCT for „ 

As we can see, from (34) and (35), a type-Hl DCT with ^ ^ ^ N is 

length N is decomposed into one type-II DCT and one v M ' & 

type-IV with length (N/2). FIG. 4 illustrates an embodiment 

of the decomposition of one 8-point type-II DCT into one is _ N ^ , * \ ^ 

4-point type-II DCT and one 4-point type-IV DCT according Xk ~L *' C0J W 2 ' + i)(2A + 1} J for * ' 0 - N ~ 1 

to the present invention. 

b. Decomposition for the Type-Ill DCT 

From (2), the kth coefficient of the type-Ill DCT for an 
input sequence x,- with length N is 2 o e . A 1 (41) 



^ t* \ (36) 



Since cosA = (cosM + B) + co^A - B)), 

2cosfl 

(40) can be represented as 
1 



X* = 



25 M^^ +l JJ 

First, both the input sequence x, and the output sequence 

of the type -I II DCT are separated. The input is separated into vw f * \ j n w 

even-indexed and odd-indexed form while the output is ^^jco^— (2A + i)(/)j + co^— (2^ + 1)(/+ Djj 
separated into the first half of the sequence and the second 
half of the sequence; that is, 30 

Next, separating input into even and odd terms yields 



** = £ Jfjico^-(0(2fc + l))+ ^ * 2 ,v t cos( — (2/ + l)(2*+l)) 



(42) 



for*=0, 1..., /V/2- 1 35 2cos(^.(2* + 1)) 



(38) 



/V/2-1 



-(2Jt + 1)(2( + 1) + 
AT 



for* =0, 1 N/2-1 



• ^ * 2l -cos £(2* + 1)(20 + ^ x^icos 

,V/2-l 

45 Z ^>iCos( — (2* + l)(2/ + 2) 



-(2A + 1)(2/+ 1) + 



Next, substituting cos^(/)^Jfc + ^ j + ljj = 

^(^Kl " 1 " k ) + l )) ^ Sel the four terms in (42) can be represented 

-co/£(2i+ M~ - 1 - *] + l)) into (38) yields Xk = +^-.)cos(^(2A + l)(i) + 

2cos(_(2* + l))Uo " 



T^H^'Wi" 1 -*)* 1 )) 

fbrt = 0...AT/2-l 60 ff 

Finally, applying the property of ^ jf;cos(— (i')(2Jt + 1)J = 

On the results of (37) and (39), a type-Ill DCT with length 
N is decomposed into one type-Ill DCT and one type-IV v/2-1 
DCT with length (N/2). FIG. 5 illustrates an embodiment of V* Xi coJ— (/X2A + l)) gives 

the decomposition of one 8-point type-Ill DCT into one 65 4u ' " 2/v 

4-point type-Ill DCT and one 4-point type-IV DCT accord- 
ing to the present invention. 
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-continued 



2cos( 



1 { n 
0* + i) Uo 



(43) 



+ 1)<0 + 



fe+^iXo^— (2*+l)(2/+ 



On the result of (43), a type-IV DCT with length N is 
decomposed into one type-IV DCT and one type- II I DCT 
with length (N/2). FIG. 6 illustrates an embodiment of the 
decomposition of one 8-point type-IV DCT into one 4-point 
type-Ill DCT and one 4-point type-IV DCT according to the 
present invention. 
3. Complexity Analysis 

From FIGS. 4-6, the arithmetic complexities for all three 
types of the DCT are 

DCT-Il(t^=A(N)+DCr-IVCN/2)+DCT-H(N/2), 
DCr-HI(N)-A(N)+DCr-IV(N/2)+DCT-III(N/2), 



and 



DCT-IV(NT>A(N-l)+M(N>+rxrr-rV(N/2>DCT-UI(N/2), 



10 



20 



40 



where DCT-II(N), DCT-III(N), and DCT-IV(N) are indi- 
vidually the arithmetic complexity of the type-II, type- II I, 
and type-IV DCT with length N; and A(u) and M(k) indicate 
the number of real additions and multiplications are ft and k, 
respectively. 

In addition, FIG. 7 illustrates the arithmetic complexity of 
the approach according to the present invention and the 
existing approaches for the radix-2 DCTs. As depicted in 
FIG. 7, the results illustrate that the present invention has a 
complexity matching with the well-known approaches. 

The decomposition method of the present invention 
through the interleaving of the three types of DCT instead of 
the same type of DCTs has the regularity and the modularity 
in addition to the low complexity. 

Furthermore, because the decomposition is the interleav- 
ing of the three types of the DCTs, the present invention is 
applicable to all the modified cosine transforms in the audio 
coding standards. 
4. Conclusion 

Variant forms of the modified cosine transforms (MCTs) 45 
have been widely used in different audio standards. The 
present invention has illustrated that all these MCTs can be 
led into two modules: a permutation and a discrete cosine 
transform. Specifically, the MCTs in encoders are accom- 
plished as an input permutation and a DCT while the MCTs 
in decoder a DCT and a post permutation. The employed 
DCTs are either type-II, type-Ill, or type IV 

The present invention has provided a new fast architecture 
for the above three types of the discrete cosine transform. 
The new approach has been developed with decomposition 
from one type of the DCT into type-II, type-Ill, or type-IV. 
The decomposition has features in decomposing one type of 
DCT into type-II, type-Ill, or type-IV The decomposition 
has been shown to have good features in regularity, 
complexity, and general applicability in all MCTs in audio 
coding standards. 

Although this invention has been described with a certain 
degree of particularity, it is to be understood that the present 
disclosure has been made by way of preferred embodiments 
only and that numerous changes in the detailed construction 
and combination as well as arrangement of parts may be 
restored to without departing from the spirit and scope of the 
invention as hereinafter set forth. 
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What is claimed is: 

1. A computation module for transforming audio data 
from time domain to transform domain for audio data 
compression, said computation module computing an M 
point discrete cosine transform of M input data points of said 
audio data, comprising: 

a permutation-add stage for permuting said input data 
points and generating a first group of M/2 permuted 
data points and a second group of M/2 permuted data 
points; 

a sub-transform stage for computing a first M/2 point 
discrete cosine transform of said first group of M/2 
permuted data points and a second M/2 point discrete 
cosine transform of said second group of M/2 permuted 
data points for generating a first group of M/2 trans- 
formed data points and a second group of M/2 trans- 
formed data points; and a combination stage for com- 
bining said first and second groups of transformed data 
points as M transformed data points of the M point 
discrete cosine transform. 

2. The computation module according to claim 1, wherein 
said permutation-add stage or said combination stage is a 
pass- through stage for passing data through without permu- 
tation or computation. 

3. The computation module according to claim 1, said M 
2 5 point discrete cosine transform and said first and second M/2 

point discrete cosine transforms being selected from the 
group of type-II, type-Ill and type-IV discrete cosine 
transforms, said type-II discrete cosine transform being 
expressed by 



x k = *< cos (^ (2/ + m) ) for * = 0i • 



35 said type-Ill discrete cosine transform being expressed by 



X k = £ Xico^(0(2* + 1)) for* = 0.1 N- 1, 



and said type-IV discrete cosine transform being expressed 
by 

N-l 

*k = Yj * ico iw i2i + 1)(2 * + 1} ) for * = °* 1 N ~ 1; 



wherein x i represents an input data point, X* represents a 
transformed data point and N is the number of input data 
points in each discrete cosine transform, 

4. The computation module according to claim 3, wherein 
said M point discrete transform is a type-II discrete cosine 
transform, said first M/2 point discrete transform is a type-II 
discrete cosine transform, and said second M/2 point dis- 
crete cosine transform is an M/2 point type-IV discrete 
cosine transform. 

5. The computation module according to claim 3, wherein 
said M point discrete transform is a type-Ill discrete cosine 
transform, said first M/2 point discrete transform is a type- II I 
discrete cosine transform, and said second M/2 point dis- 
crete cosine transform is an M/2 point type-IV discrete 
cosine transform. 

6. The computation module according to claim 3, wherein 
said M point discrete transform is a type-IV discrete cosine 
transform, said first M/2 point discrete transform is a type-II I 
discrete cosine transform, and said second M/2 point dis- 
crete cosine transform is an M/2 point type-IV discrete 
cosine transform. 
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7. A unified architecture for implementing a cosine modu- 
lated filter bank having a modified cosine transform for 
transforming audio data from time domain to transform 
domain for audio data compression, comprising: 

a pre-permutation module; and 

a transform computing module for computing a discrete 
cosine transform; 

wherein said modified cosine transform is decomposed 
into a pre-permutation executed by said pre- 
permutation module and a discrete cosine transform 
computed by said transform computing module, said 
discrete cosine transform being one selected from the 
group of type-II, type-Ill and type IV discrete cosine 
transforms, said type-II discrete cosine transform being 
expressed by 



said type-Ill discrete cosine transform being expressed 
by 

N-l 

X k - £ *iCOs( — (0(2* + 1)) for* = 0, 1 N - 1, 



and a combination stage for combining said first and 
second groups of transformed data points as M trans- 
formed data points of the M point discrete cosine 
transform. 

s 12. The unified architecture according to claim 11, 
wherein said permutation-add stage or said combination 
stage is a pass-through stage for passing data through 
without permutation or computation. 

13. A unified architecture for implementing a cosine 
modulated filter bank having an inverse modified cosine 

10 transform for inversely transforming audio data from trans- 
form domain to time domain for audio data compression, 
comprising: 

a transform computing module for computing a discrete 
cosine transform; and 

15 a post-permutation module; 

wherein said inverse modified cosine transform is decom- 
posed into a discrete cosine transform computed by 
said transform computing module and a post- 
permutation executed by said post-permutation 
module, said discrete cosine transform being one 

20 selected from the group of type-II, type-Ill and type IV 
discrete cosine transforms, said type-II discrete cosine 
transform being expressed by 

N-l n 

25 ~ £ *' cos ( — (2f ' + 1)(Jt) ) for k = °' 1 N - 1 ' 



and said type-IV discrete cosine transform being said type-IH discrete cosine transform being expressed 

expressed by by 



30 



K k = £ *;cos( — (2/ + 1X2* + lj) for * = 0, 1 N-U ** = X X ' tQ0 \w C0(2 * + for * °' 1 N ~ l * 



wherein x t represents an input data point of said audio 
data, X k represents a transformed data point and N is 
the number of input data points in each discrete cosine 
transform, 

8. The unified architecture according to claim 7, wherein 
said cosine modulated filter bank is a time-domain aliasing 
cancellation filter bank, said modified cosine transform is 
decomposed into a pre-permutation and a type-IV discrete 
cosine transform. 

9. The unified architecture according to claim 7, wherein 
said cosine modulated filter bank is a variant of a time- 
domain aliasing cancellation filter bank, said modified 
cosine transform is decomposed into a pre-permutation and 
a type-IV discrete cosine transform. 

10. The unified architecture according to claim 7, wherein 
said cosine modulated filter bank is a polyphase filter bank, 
said modified cosine transform is decomposed into a pre- 
permutation and a type-Ill discrete cosine transform. 

11. The unified architecture according to claim 7, said 
transform computing module further including a computa- 
tion module for computing an M point discrete cosine 
transform of M input data points, said computation module 
comprising: 

a permutation-add stage for permuting said input data 
points and generating a first group of M/2 permuted 
data points and a second group of M/2 permuted data 
points; 

a sub-transform stage for computing a first M/2 point 
discrete cosine transform of said first group of M/2 
permuted data points and a second M/2 point discrete 
cosine transform of said second group of M/2 permuted 
data points for generating a first group of M/2 trans- 
formed data points and a second group of M/2 trans- 
formed data points; 



35 and said type-IV discrete cosine transform being 
expressed by 

N-l n 

X k =T X;Co{—(2i + 1)(2* + 1)) for * = 0, 1 N - 1; 

40 '=° 

wherein x,- represents an input data point of said audio 
data, X k represents a transformed data point and N is 
the number of input data points in each discrete cosine 
45 transform. 

14. The unified architecture according to claim 13, 
wherein said cosine modulated filter bank is a time-domain 
aliasing cancellation filter bank, said inverse modified 
cosine transform is decomposed into a type-IV discrete 

50 cosine transform and a post -permutation. 

15. The unified architecture according to claim 13, 
wherein said cosine modulated filter bank is a variant of a 
time-domain aliasing cancellation filter bank, said inverse 
modified cosine transform is decomposed into a type-IV 
discrete cosine transform and a post -permutation. 

55 16. The unified architecture according to claim 13, 
wherein said cosine modulated filter bank is a polyphase 
filter bank, said inverse modified cosine transform is decom- 
posed into a type-Ill discrete cosine transform and a post 
permutation. 

17. The unified architecture according to claim 13, said 
60 transform computing module further including a computa- 
tion module for computing an M point discrete cosine 
transform of M input data points, said computation module 
comprising: 

a permutation- add stage for permuting said input data 
65 points and generating a first group of M/2 permuted 
data points and a second group of M/2 permuted data 
points; 



03/16/2004, EAST version: 1.4.1 



6,119,080 

17 18 

a sub-transform stage for computing a first M/2 point 

discrete cosine transform of said first group of M/2 n-i n 

permuted data points and a second M/2 point discrete = £* ; co^— (0<^ + l)) for* =0, l N-i t 

cosine transform of said second group of M/2 permuted 
data points for generating a first group of M/2 trans- 5 

formed data points and a second group of M/2 trans- and said type-IV discrete cosine transform being 

formed data points; expressed by 
and a combination stage for combining said first and 

second groups of transformed data points as M trans- N ^ n 

formed data points of the M point discrete cosine 10 ^ = (2/ + l)(2A + l)j for^ 0, l N-L 

transform. 

18. The unified architecture according to claim 17, 

wherein said permutation-add stage or said combination wherein x ( - represents an input data point of said audio 

stage is a pass-through stage for passing data through JS data> Xk represents a transformed data point and N is 

without permutation or computation. the numbe r of input data points in each discrete cosine 

19. A method for computing a modified cosine transform transform* 

for transforming audio data from time domain to transform jj* * • . f a 

, . c J-., • c computing the decomposed discrete cosine transform; and 

domain for audio data compression, comprising the steps of: . . 

, j-cj - t c • * performing the post-permutation, 

decomposing said modified cosine transform into a pre- ?n £.« A *l , c w ■ t <■ 

r * , ,. . - , j u 21. A method of computing an M point discrete cosine 

permutation and a discrete cosine transform selected t c c xx . t , r < . . * j* j * *■ 

c iL tt in j * it/ j- * transform of M input data points of audio data tor trans- 

from the group of type-II, type-Ill and type IV discrete c . , _,. , r ^ * < . r 

o r jr wr jr forming said audio data from time domain to transtorm 

cosine transforms, said type-II discrete cosine trans- , . ° f ,. , „ ■ • .u . r 

„ , . i. domain tor audio data compression, comprising the steps ot: 

form being expressed by r 1 ° 

permuting said M input data points and generating a first 

N _ ( 25 group of M/2 permuted data points and a second group 

Xk = Z XiQO ijN ai + 1)(Jt) ) iork = 0A N ~ u of M/2 permuted data points; 

i=0 computing a first M/2 point discrete cosine transform of 

said first group of M/2 permuted data points and a 

. . TTT , t c ,. , in second M/2 point discrete cosine transform of said 

said type-Ill discrete cosine transtorm being expressed 30 , cam/~> . j j . * * f 

Kif Jr & r second group of M/2 permuted data points tor gener- 
ating a first group of M/2 transformed data points and 
a second group of M/2 transformed data points; 

x k = ^*,cos(— -(/)(2Jt + l))forJt = 0, 1,..., n-u and combining said first and second groups of trans- 

i=o 2N 35 formed data points as M transformed data points of said 

M point discrete cosine transform. 
22. The method of computing an M point discrete cosine 

and said type-IV discrete cosine transform being transform of M input data points according to claim 21, said 

expressed by M point discrete cosine transform and said first and second 

40 M/2 point discrete cosine transforms being selected from the 

X, = V Wi» ♦ 1X2* ♦ 0) fcr * - 0. 1 N - 1; f™? ° f ^ ^ 

£o m/v n transforms, said type-II discrete cosine transform being 

expressed by 

wherein x,- represents an input data point of said audio 45 ^ f n . 

data, X k represents a transformed data point and N is Xk = L x ' co \2N (2i U m) > for * = (U N ~ u 

the number of input data points in each discrete cosine 
transform; 

performing the pre-permutation; and said type-Ill discrete cosine transform being expressed by 
computing the decomposed discrete cosine transform. 50 

20. A method for computing an inverse modified cosine ^ , n . 

transform for inversely transforming audio data from trans- = 2j *i ct H^(0<2* + D) fort = o. l N- 1, 

form domain to time domain for audio data compression, 
comprising the steps of: 

decomposing said inverse modified cosine transform into and said type-IV discrete cosine transform being expressed 

a post-permutation and a discrete cosine transform by 
selected from the group of type-II, type-Ill and type IV 

discrete cosine transforms, said type-II discrete cosine r*-i n 

transform being expressed by £n x * = £ x ' C0 \7Z {li ^ 1)(2 * + •>) for * = 0 ' 1 N ~ v > 

X k = yXiCos(--(2/+lXJt))forJt=0,l,...,W-l, , . v 

jp* *2/v ' wherein x,- represents an input data point, X* represents a 

transformed data point and N is the number of input data 

65 points in each discrete cosine transform. 



by 



N-l 



said type-Ill discrete cosine transform being expressed 
by 



* * * * # 
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